A Novel Spatio-Temporal-Wise Network for Action Recognition

نویسندگان

چکیده

Action recognition is a challenging task that requires understanding the temporal relationships between frames. However, capturing and processing spatio-temporal motion features computationally expensive, making it difficult to apply practical situations. We propose novel approach called Spatio-Temporal-Wise (STW) network address this problem. The STW inserts blocks, consisting of Spatio-Temporal Fusion Module Temporal-Wise Module, into an existing 2D CNN. This very little additional computational overhead but brings huge performance improvements in recognizing human actions. proposed method evaluated on several public datasets, including Something-Something v1 & v2, Kinetics-400, UCF101, HMDB51. achieved comparable or better these datasets compared state-of-the-art methods. Notably, improves accuracy by 26.6% 34.6% v2 respectively, with less than 2% overhead. results demonstrate can significantly improve action tasks while requiring only small overhead, which represents promising direction for developing more efficient effective approaches handling reasoning recognition, may have important applications future.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Spatio-temporal Manifold Network for Action Recognition

Visual data such as videos are often sampled from complex manifold. We propose leveraging the manifold structure to constrain the deep action feature learning, thereby minimizing the intra-class variations in the feature space and alleviating the over-fitting problem. Considering that manifold can be transferred, layer by layer, from the data domain to the deep features, the manifold priori is ...

متن کامل

Spatio-temporal SURF for Human Action Recognition

In this paper, we propose a new spatio-temporal descriptor called ST-SURF. The latter is based on a novel combination between the speed up robust feature and the optical flow. The Hessian detector is employed to find all interest points. To reduce the computation time, we propose a new methodology for video segmentation, in Frames Packets FPs, based on the interest points trajectory tracking. W...

متن کامل

Human Action Recognition Using Spatio-temporal Classification

In this paper a framework “Temporal-Vector Trajectory Learning” (TVTL) for human action recognition is proposed. In this framework, the major concept is that we would like to add the temporal information into the action recognition process. Base on this purpose, there are three kinds of temporal information, LTM, DTM, and TTM, being proposed. With the three kinds of proposed temporal informatio...

متن کامل

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR). The proposed STANNCR is based on action component and incorporates the spatial-temporal information. We first introduce a spatial-temporal distribution vector (STDV) to model the distributions of local feature locations in a compact and discri...

متن کامل

Improved Spatio-temporal Salient Feature Detection for Action Recognition

Spatio-temporal salient features localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition. The robust detection of these features under geometric variations such as affine transformation and view/scale changes is however an open problem. Existing methods use the same filter for both time and space and hence, perform an ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3274542